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Abstract. We construct the first examples of algorithmically complex finitely presented 
residually finite groups, the first examples of finitely presented residually finite groups 
with arbitrarily high (recursive) Dehn functions, and arbitrary large depth functions. The 
groups are solvable of class 3. We also prove that the universal theory of finite solvable of 
class 3 groups is undecidable. 
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1. Introduction 

1.1. The problem and previous approaches for a solution. It is well known that 
finitely presented residually finite groups are much simpler algorithmically than arbitrary 
finitely presented groups. For example, the word problem in every such group is decidable. 
Moreover the most "common" residually finite groups, the linear groups over fields, are 
algorithmically "tame": the word problem in any linear group is decidable in polynomial 
time and even log-space [35]. Dehn functions are important witnesses of the complexity 
of the word problem. Although it is well known that groups with solvable word problem 
can have very large Dehn functions [40 \ I58j . no examples of finitely presented residually 
finite groups with superexponential Dehn function are known. Thus one of the main open 
problems in this area is how large could the Dehn function of a residually finite finitely 
presented group be. The question was known since early 90s. It was open for so long 
because all known methods to construct algorithmically hard groups produced either non- 
residually finite groups or groups where the question about their residual finiteness is very 
difficult. Not much is known even for linear groups (Steve Gersten asked \19\ [20] if there 
exists a uniform upper bound for Dehn functions of linear groups). Let us briefly discuss 
the previous attempts to solve the problem and the reasons why these methods did not 
work. 

1.1.1. Method 1. Known groups with large Dehn functions. One could hope that some of 
the known finitely presented groups with very large Dehn function may turn out to be 
residually finite, which would shed some light on how to produce residually finite groups 
with even larger Dehn functions. Unfortunately, this is not the case, all these groups are 
non-residually finite. For example, the Dehn function of the one relator group Gn$\ = 
(a,b | b~ 1 a~ 1 bab~ 1 ab = a 2 } introduced by Baumslag in [2] is bigger than any iterated 
exponent (see Gersten [18]). Platonov [51] proved that it is equivalent to the function 
exp( log2n )(l), where exp( m )(x) is the function defined by exp(°)(n) = n and exp( fc+1 )(n) = 
exp(exp( fc )(n))). However, Gn^) is not residually finite (and in fact has very few finite 
quotients) [2]. Furthermore, the word problem in Gn^) is in polynomial time [42] . 

1.1.2. Method 2. Using subgroups with very large distortion. Consider a finitely presented 
group G and a "badly" distorted finitely generated subgroup H. Let T = (G,t \ t~ l ht = 
h (h 6 H)) be the HNN extension of G where the free letter t centralizes H. It was noticed 
by Bridson and Hafliger [lit Theorem 6. 20. Ill] that the Dehn function of T is at least as 
large as the distortion function of H in G. The following result puts some limitations on 
this method of constructing complicated residually finite groups. 

Lemma 1.1. If the group T is residually finite then H is closed in the pro- finite topology 
ofG. 

Proof. In the notation above, suppose T is residually finite and u € G\H. Then w = [u, t] ^ 
1 in T (by the standard properties of HNN extensions). Hence there exists a homomorphism 
<p from T onto a finite group T w such that <p(w) ^ 1. But this implies (j)(u) <j>(H) (since 
every element of (j)(H) commutes with <fi(t)). Hence there exists a normal subgroup of finite 
index N < G such that u does not belong to NH. In other words H must be closed in the 
pro-finite topology of G. □ 

Consider the following typical examples of residually finite groups G with highly distorted 
subgroups H. The first one is Wise's version [M] of Rips' construction [53] which for every 
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finitely presented group Q gives a finitely presented residually finite small cancellation group 
G with a short exact sequence 

l^N^G^Q^l 

where N finitely generated. It is easy to see that the distortion function of N in G is 
at least as bad as the Dehn function in Q, so choosing Q properly one can get a finitely 
presented residually finite group G with a highly distorted subgroup N. Now, the subgroup 
N is normal in the HNN extension T. So it is closed in the pro-finite topology of T only if 
Q = G/N is residually finite. By Lemma T can be residually finite only if Q is residually 
finite. In other words to construct a complicated finitely presented residually finite group 
T one has to have the initial group Q complicated, finitely presented and residually finite 
as well. 

The second example is the standard Mikhailova construction. In this case highly distorted 
subgroups of the direct product of two free groups F2 x F2 can be obtained as equalizers 
of two homomorphisms (f>\ : E\ — > M and fa '■ E2 — > M where E\ , E2 are finitely generated 
subgroups of F2 and M is finitely presented (see Sections 11.41 El below) . But by Remark 15.41 
below the equalizer is closed in the pro-finite topology only if M is residually finite. Thus as 
in the previous example, in order to construct an algorithmically complex residually finite 
finitely presented group using Mikhailova's construction and the HNN extensions as above, 
one needs to have already a finitely presented residually finite algorithmically complex group 
M. 

The third example is Cohen's construction of highly distorted subgroups employing the 
modular machines |12] , One can also prove that in that construction the subgroup will be 
pro-finitely closed only if the modular machine is very easy. 

One can also try to use the hydra groups [HI [10] to construct HNN extensions as above 
with Dehn functions bigger than any prescribed Ackermann function. The question of 
whether these groups are residually finite is open (most probably the answer is negative 
because the distorted subgroup should not be closed in the pro-finite topology, but this 
needs a proof). 

1.1.3. Method 3. Boone-Novikov constructions. One of the standard ways to produce algo- 
rithmically complicated groups is by simulating Turing machines using free constructions 
(HNN extensions and amalgamated products) which goes back to the seminal papers by 
Boone and Novikov (see, for example, |54j). There are currently many versions of that 
construction (for a recent survey see |57j). But in fact, it can be shown that for each known 
version of the proof of Boone-Novikov theorem using free constructions, even for easy Tur- 
ing machines the corresponding group is non-residually finite. This is, for instance, the 
main idea of the example in plj. Here is an even easier example. Let G = G(M) be a 
group constructed by any of these constructions. Then for every input word u of the Turing 
machine M there exists a word w = w(u) obtained by inserting some copies of u in u>(0), so 
that u is accepted by M if and only if w(u) = 1 in G. Now consider M that accepts a word 
a n if and only if n 7^ (that machine is actually one of the basic building blocks in [58J). 
Then w(a n ) = 1 in G if and only if n 7^ 0. Suppose that there exists a homomorphism 
onto a finite group H that separates w($) = w(a°) from 1. Then cf>{a) has finite order, say, 
s, in H. Therefore 4>(w(®)) = (p(w(a )) = 4>(w(a s )) = 4>{1) = 1, a contradiction. Hence 
G(M) is not residually finite. 
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1.1.4. Method 4- Residually finite groups obtained by free constructions. In general, the 
question about residual finiteness of free constructions is very difficult. Currently there 
are only two large classes of groups where the question was settled: these are ascending 
HNN extensions of free groups [7] and certain groups acting "nicely" on CAT(0)-cubical 
complexes including small cancelation groups (see the recent work of Wise, for example, [25] 
and references therein). All these groups have easy word problem and uniformly bounded 
Dehn functions. The reason for the lack of more examples is that groups obtained by free 
constructions from "nice" groups contain a lot of extra elements and it is not at all clear 
how to separate these elements from 1 by homomorphisms onto finite groups. In the two 
cases when it could be done, it was possible to reformulate the problem in the language of 
algebraic geometry and geometric topology, respectively. 

1.1.5. Method 5. Other groups with complicated word problem. There are several other 
constructions of groups with complicated word problem but each of these also almost always 
produce non-residually finite groups. For example, the group in [38] is based on the R. 
Thompson group V which is infinite and simple (hence not residually finite). 

1.1.6. The main results of the paper. In this paper, we construct finitely presented residu- 
ally finite groups with arbitrary complex word problem, and also easy word problem but 
arbitrary high (of course recursive) Dehn function, and arbitrary high depth function. We 
also give applications of these results to the questions about solvability of the universal 
theory of finite solvable groups. 

Our constructions are based on simulating Minsky machines in groups. Surprisingly, 
the algebraic structure of the group not only depends on the construction itself, but also 
heavily depends on the computational properties of the machines we simulate: for the group 
to be residually finite the Minsky machines should be sym-universally halting, that is their 
transition graphs are vertex-disjoint unions of finite trees. 

1.1.7. What next? We expect the approach used in this paper to be useful in solving other 
problems that are still open. For example, the residually finite version of the Higman 
embedding theorem would be very desirable. It is known [39} fl5| l2~T| [33] that a finitely 
generated recursively presented residually finite group may have undecidable word problem, 
and hence cannot be embedded into a finitely presented residually finite group. But it is 
not known whether every unsolvability of the word problem is the only obstacle for such an 
embedding. Thus it would be very interesting to find out whether every finitely generated 
residually finite group with solvable word problem embeds into a finitely presented residually 
finite group. Note that usually a version of Boone-Novikov theorem precedes a version of 
Higman theorem, hence we can consider this paper as a step toward the residually finite 
version of Higman's theorem. 

1.2. The "yes" and "no" parts of the McKinsey algorithm. One of the initial mo- 
tivations for studying residually finite groups, semigroups and other algebraic structures 
was McKinsey's algorithm solving the word problem in finitely presented residually finite 
algebraic structures. Even though the algorithm is well known and classical, surprisingly 
little is known about its complexity. In this paper, we fill this gap. 

Let G = {X; R) be a residually finite finitely presented algebraic structure of finite type 
(signature) T (say, groups, semigroups, rings, etc.) Let us recall McKinsey's algorithm 
solving the word problem in G (see [41] , [37]). The word problem is divided into two parts. 
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Let F(X) be the free algebraic structure of type T freely generated by X. Then we define 
the "yes" and "no" parts of the word problem in G as follows: 

WP ycs = {{w,w') G F(X) | w = G w'} and WP no = {(w,w') € F(X) \ w ^ G w'}. 

To solve the word problem in G one runs in parallel two separate algorithms ^4 ycs and A no , 
such that starting with a given pair of elements w, w' G F(X) A ycs stops if and only if 
(w,w') G WP y es and A no stops if and only if (w,w') G WP no . 

The algorithm -4 yes enumerates one by one all consequences of the defining relations 
R and waits until w = w' appears in the list. 

The algorithm A no enumerates all homomorphisms 4>±, (f)2, ■ ■ ■ , of G into finite algebraic 
structures of type T and waits until (f>i(w) ^ 4>i(w'). 

Let now G be a finitely presented residually finite group. Although it seems like in 
general A ycs and A no are very slow, there were no examples of groups G for which these 
algorithms were actually very slow. More precisely, there were no known examples of finitely 
presented residually finite groups with very hard "yes" or "no" part of the word problem. 
Indeed, the most "common" residually finite groups are linear groups, say, over fields [37] . 
In that case it is well known that the "yes" part can be solved in deterministic polynomial 
time [35] El]. The "no" part can be solved by considering factor groups corresponding to 
ideals of finite index of some polynomial rings, hence also can be shown to be solvable in 
deterministic polynomial time. In fact the same can be said about most finitely presented 
groups (where "most" means "with overwhelming probability" in one of several probabilistic 
models): recent results of Agol [T] and Ollivier and Wise [47] together with the older result 
of Olshanskii |48| imply that most finitely presented groups are linear (even over Z). 

One of our main results is the following theorem (an immediate corollary of Theorem 
I4TT91 below): 

Theorem. Let f(n) be a recursive function. Then there exists a residually finite finitely 
presented solvable group G such that for any finite presentation {X;R) of G the time com- 
plexity of both "yes" and "no" parts of the word problem are at least as high as f(n). 

We also show that both algorithms A ycs and A no can be very slow even when both "yes" 
and "no" parts of the word problem are easy. 

Remark 1.2. Note that if we replace "finitely presented" assumption by "recursively pre- 
sented", then residually finite groups are known to be very complicated. As we have men- 
tioned before, recursively presented finitely generated residually finite groups may have 
undecidable word problem [39] H5] [21]). Even more, recently the second author and B. 
Khoussainov constructed residually finite Dehn monsters, i.e., infinite groups which are re- 
cursively presented, residually finite and algorithmically finite [33j . These are groups where 
the word problem is not only undecidable, but one cannot algorithmically enumerate an 
infinite set of pair-wise distinct elements of the group. 

Note also that although our groups are not linear they are (elementary Abelian)-by-linear 
since they are solvable of class 3 with the second derived subgroup elementary Abelian. 

1.3. Quantification of the "yes" part: the Dehn function. It was noticed by Madlener 
and Otto [40] that the Dehn function of a group measures the complexity of the word prob- 
lem. They also constructed finitely presented groups with arbitrary large Dehn functions. 
For residually finite groups, the situation is different. Nilpotent groups are examples of 
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residually finite groups with arbitrary high polynomial Dehn function [3]. The Baumslag- 
Solitar groups (x,y \ x y = x k ), k > 2, are examples of residually finite (even linear) groups 
with exponential Dehn function. No examples of residually finite groups with bigger Dehn 
functions were known. This gap is filled by the following 

Theorem 14. 181 For every recursive function f, there is a residually finite finitely presented 
solvable of class 3 group G with Dehn function greater than f . In addition, one can assume 
that the word problem in G is at least as hard as any given recursive function or as easy as 
polynomial time. 

As a corollary of Theorem 4.18 we mention the following exotic examples of groups. 

Corollary. For every recursive function f, there is a residually finite finitely presented 
solvable of class 3 group G with Dehn function greater than f and the word problem decidable 
in polynomial time. 

1.4. Quantification of the "no" part: the depth function. The function quantifying 
the algorithm „4 ycs is the depth function introduced by Bou-Rabee [9] . Recall that if G = (X) 
is a finitely generated group or semigroup, the depth function pc(n) is the smallest function 
such that every two words w w' of length at most n are separated by a homomorphism 
to a group (semigroup) H with \H\ < pq(ji). That function does not depend on the choice 
of finite generating set X (up to the natural equivalence). 

It is easy to see that for every finitely generated linear group or semigroup G, pc is 
at most polynomial. Since finitely generated metabelian groups are subgroups of direct 
products of linear groups [62] the depth function of every finitely generated metabelian 
group is at most polynomial. By the recent result of Agol jT] based on the earlier results of 
Wise |63j, every small cancelation group is a subgroup of a Right Angled Artin group, hence 
linear and has polynomial depth function. In fact one can have much smaller bounds for 

2 

many linear groups. For example, for the free group F2, PF 2 { n ) ls a t most n3 by a result of 
Kassabov and Matucci [26] . There are some finitely presented groups for which the depth 
function is unknown and very interesting. For example the ascending HNN extensions of 
free groups are known to be residually finite and even virtually residually nilpotent (proved 
by A. Borisov and the third author 0[8]) but the only upper bound one can deduce from the 
proof is exponential. Although many of these groups have small cancelation presentations 
and so covered by the results from pQ, there are some groups of this kind for which the 
depth function is not known. One of these groups is (x,y,t \ txt~ l = xy,tyt~ l = yx). 
The fact that it is hyperbolic follows from Bestvina- Feign combination theorem [6] and was 
proved by Minasyan (unpublished). If the depth function of that group is not polynomial, 
that group would not be linear, disproving a conjecture by Wise (he conjectured that all 
hyperbolic ascending HNN extensions of free groups are linear and, moreover, subgroups of 
Right Angled Artin groups). 

For finitely generated infinitely presented groups (even amenable ones) the situation is 
much more clear now. Using the method of Kassabov and Nikolov [27] and the result of 
Nikolov and Segal [36] one can construct a finitely generated residually finite group with 
arbitrary large recursive depth function. 

In this paper, we show that a similar result holds for finitely presented solvable of class 
3 groups. 

Theorem 14. 191 For every recursive function f , there is a residually finite finitely presented 
solvable of class 3 group G with depth function greater than f. In addition, one can assume 
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that the word problem in G is at least as hard as the membership problem in a given recursive 
set of natural numbers Z or as easy as polynomial time. 

As a corollary of Theorem 4.20 we mention the following exotic examples of groups. 

Corollary. For every recursive function f , there is a residually finite finitely presented 
solvable of class 3 group G with depth function greater than f and the word problem decidable 
in polynomial time. 

1.5. Distortion of pro-finitely closed subgroups of finitely presented groups. Let 

G be a group generated by a finite set X, H < G be a subgroup generated by a finite set 
Y. Recall that the distortion function fH,G( n ) is defined as the minimal number / such 
that every element of H represented as a word w of length < n in the alphabet X can be 
represented as a word of length < / in the alphabet Y [16] . It is clear [16] that the distortion 
function /gh is recursive if and only if the membership problem in H is decidable. 

Recall that H is closed in the pro-finite topology of G if H is the intersection of some 
subgroups of G of finite index. If G is finitely presented and H is closed in the pro- 
finite topology of G, then there exists a McKinsey-type algorithm A(G, H) solving the 
membership problem for H (and thus the fc,H is recursive). For every word w in the 
alphabet X, the "yes" part A yes (G,H) of the algorithm lists all words in Y, rewrites them 
as words in X, and then applies relations of G to check whether one of these words is 
equal to w. The "no" part A no (G, H) of the algorithm lists all homomorphisms <f> of G 
into finite groups and checks whether <j)(w) ^ <P{H). As in Section [1.21 one can asks what 
is the complexity of the "yes" and "no" parts of that algorithm, in particular, and of the 
membership problem for H in general. 

One can also quantify the complexity of the two parts A jes (G,H) and A no (G, H). The 
"yes" part is quantified by the distortion function fc,H{n) and the "no" part is quantified 
by the relative depth function PG,H(n) which is defined as the minimal number r such that 
for every word w of length < n in X which does not represent an element of H there exists 
a homomorphism (j) from G to a finite group of order < r such that 4>(w) <j){H). 

As for the word problem in residually finite finitely presented groups (discussed above), 
there were no examples of finitely generated subgroups of finitely presented groups that 
are closed in the pro-finite topology but have "arbitrary bad" distortion or "arbitrary bad" 
relative depth function. 

The Mikhailova's construction shows that finitely generated subgroups of the residually 
finite group F2 X F2 (here F2 is a free group of rank 2) could be as distorted as one pleases. In 
fact the set of possible distortion functions of subgroups of F2 x F2 coincides, up to a natural 
equivalence, with the set of Dehn functions of finitely presented groups [19]. By a result of 
Baumslag and Roseblade [5] subgroups of F2 x F2 are equalizers of pairs of homomorphisms 
4>: Ffc — > G, tp: F n — s> G (where F^^Fn are subgroups of F2), i.e. the subgroups of the form 
{(x,y) € -Ffc x F n j 4>{x) = ip(y)}. The equalizer subgroup is finitely generated if and only 
if G is finitely presented. It is easy to prove (see Lemma 15.21 below) that if G is residually 
finite, then the equalizer is closed in the pro-finite topology of F2 x F2. Thus we can use 
the examples of residually finite finitely presented groups with complicated word problem 
and complicated depth function to prove the following 

Theorem 15.51 For every recursive function f(n) there exists a finitely generated subgroup 
H < -F2 X i*2 that is closed in the pro-finite topology of F2 x F2 and whose distortion function 
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fF 2 xF 2 ,H, the relative depth function, and the time complexities of both "yes" and "no" parts 
of the membership problem are at least f(n). 

There is an analogous (though a bit weaker) result, for subgroups of a direct product 
Ss(X) x Ss(X), where Ss(X) is a free solvable group of class 3 with free generating set X. 

Theorem 15. 61 For any recursive function f (n) there is a finite set X and a finitely generated 
subgroup H G S3 (X) x S^X) such that E is closed in the pro-finite topology on Ss(X) x 
S-s(X) and the distortion function, the relative depth function, and the time complexities of 
both "yes" and "no" parts of the membership problem in H are at least f{ri). 

1.6. Methods of proof. As we have shown above (see Section 1 1 . 1 . 3 j) most versions of 
the Boone-Novikov construction ( [Ml ESI 121 EH] ) do not produce residually finite groups. 
Instead, we simulate Minsky machines in solvable groups of class 3. A similar construction 
is due to the first author [28]. We use a version from the unpublished thesis [29]. As in 
[281129], our group is a split extension of an elementary Abelian group of prime exponent by 
a metabelian group. Since every metabelian group has easy word problem and is residually 
finite, we can concentrate only on the elementary Abelian subgroup which is spanned, 
basically, by the configurations of the Minsky machine encoded in a certain way. That 
encoding is a very important feature of the construction. It helps us avoid the problem 
from Section [1.1.31 because the words w(u) corresponding to the input configurations u of 
the Minsky machines are not obtained by inserting copies of u in w($). 

Of course that construction also often leads to non-residually finite groups. But it turned 
out that the difficulty can be overcome by modifying the Minsky machine first. In this paper, 
we use the fact that every Turing machine and every Minsky machine with decidable halting 
problem is equivalent to a universally halting and even sym-universally halting machine 
(these machines can be characterized as the machines whose transition graphs are vertex 
disjoint unions of finite trees). 

1.7. Structure of the paper. The paper is organized as follows. Section [2] contains 
preliminary results about Turing and Minsky machines that are needed further. We show 
that one can modify any Turing or Minsky machine that recognizes a recursive set into a 
machine that halts on every configuration. In fact we can even assume that the symmetrized 
machine always halts (we call such machines sym-universally halting). 

In Section El we simulate sym-universally halting Minsky machines in residually finite 
finitely presented semigroups and prove the analogs of the above theorems for semigroups. 

In Section H] we simulate Minsky machines in solvable groups and construct complicated 
residually finite finitely presented groups. 

Sections E] and E] contain applications of the main theorems. In Section El we prove, in 
particular, Theorem 15.51 In Section El we strengthen the well-known result of Slobodskoi 
about undecidability of the universal theory of finite groups. We show, in particular, that 
the universal theory of any set of finite groups that contains all finite solvable groups of 
class 3 is undecidable. 

Acknowledgement. The authors are grateful to Jean-Camille Birget and Friedrich 
Otto for pointing to the references [13] , to Ben Steinberg for pointing to the reference [35] ,to 
Rostislav Grigorchuk for pointing to the references [151 [2T] and to Tim Riley for pointing to 
the references [19[ I20j. We are also grateful to Markus Lohrey and Ralph Strebel for their 
comments. 
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2. Turing machines and Minsky machines 

2.1. Turing machines. Let us give a definition of a Turing machine. A Turing machine M 
with K tapes consists of hardware (the tape alphabet A = U^ =1 Ai, and the state alphabet 
Q = U^ 1 Q.J3) and program P (the list of commands, defined below). A configuration of a 
Turing machine M is a word 

a\Uiq\Viu>i aiu-iqzviwi ■ ■ ■ o^kukQkvr^k 

(we included spaces to make the word more readable) where Ui , Vi are words in Ai, qi € Qi 
and oti,(jJi are special symbols (not from A U Q). 

A command simultaneously replaces subwords atqibi by words a\q^ i where ai,a\, are 
either letters from Ai U {a«} or empty, bi, b\ are either letters from Ai U {oJi} or empty. A 
command cannot insert or erase a, or cJi, so if, say, = ai, then a' i = ai. Note that with 
every command 9 one can consider the inverse command 9~ l which undoes what 9 does. 

A computation of M is a sequence of configurations and commands from P: 

IWi > W 2 — K . . — >■ 

Here / is called the length of the computation. We choose stop states q® in each Qi, then 
we can call a configuration w accepted if there exists a computation starting with w and 
ending with a configuration where all state symbols are q® and all tapes are empty. Also 
we choose start states q\ in each Qi. Then an input configuration corresponding to a word 
u over A\ is a configuration inp(it) of the form 

aiuq\uJi a 2 q\u2 ■ ■ ■ axq^K- 

We say that a word u over is accepted by M if the configuration inp(u) is accepted. The 
set of all words accepted by M is called the language accepted by M. 

The time function Tm (n) of M is the minimal function such that every accepted word of 
length < n has an accepting computation of length < Tm (n) ■ The space function Sm ( n ) of 
M is the minimal function such that every accepted word of length < n has an accepting 
computation where every configuration has length < Sm(it>). 

A Turing machine M is called deterministic if for every configuration, there exists at 
most one command from the program P that applies to this configuration. 

In this paper, we shall consider several types of machines. A machine M in general has an 
alphabet and a set of words in that alphabet called configurations. It also has a finite set of 
commands. Each command is a partial injective transformation of the set of configurations. 
A computation is a sequence 

0i ft, 9, 
wi — >w 2 — > ... — > w i+1 . 

where Wj are configurations, 9\, 9 n are commands and 6i(wi) = u>j+i for every i = 1, n. 
A machine is called deterministic if the domains of its commands are disjoint. A machine 
usually has a distinguished stop configuration, and a set / = I(M) of input configurations. 
A configuration is called accepted by M if there exists a computation connecting that con- 
figuration with the stop configuration. The machine Sym(M) is defined in the natural way 
(add the inverses of all commands of M). Two configurations w, w' are called equivalent, 
written w =m w ' \ if there exists a computation of Sym(M) connecting these configurations. 
Clearly, =m is an equivalence relation. 



U denotes disjoint union 
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The following general lemma is easy but useful. 

Lemma 2.1. Suppose that M is deterministic. Then two configurations w,w' of M are 
equivalent if and only if there exists two computations of M connecting w,w' with the same 
configuration w" of M . 

Proof. Indeed, since M is deterministic, in any computation of Sym(M) where no command 
is followed by its inverse inverses of command of M cannot be followed by commands of M. 
Thus the computation is a concatenation of two (possibly empty) parts: the first part uses 
only commands of M, the second part uses only inverses of commands of M. □ 

We say that a set X of natural numbers is enumerated by a machine M if there exists a 
recursive encoding fi of natural numbers by input configurations of M such that a number 
u belongs to X if and only if fi(u) is accepted by M. The set X is recognized by M if 
M enumerates X and for every input configuration every computation starting with that 
configuration eventually halts (arrives to a configuration to which no command of M is 
applicable). 

We say that machine M 1 polynomially reduces to a machine M if there exists an poly- 
nomial time algorithm A checking equivalence of configurations of M' which uses an oracle 
checking equivalence of configurations of M such that 

• Any computation of A verifying equivalence of configurations c, d of M' involves at 
most polynomial (in terms of |c| + |c'| ) number of uses of the oracle, 

• and every time the sizes of the configurations of M whose equivalence the oracle 
should check are polynomially bounded in terms of |c| + \c'\. 

We say that M and M' are polynomially equivalent if there are polynomial reductions of 
M to M' and vice versa. 

2.2. Universally halting Turing machines. A (not necessarily deterministic) machine 
M is called universally halting if for every configuration w of M there exist only finitely 
many computations of M starting with w without repeated configurations. 

We call a deterministic machine M sym-universally halting if Sym(M) halts if it starts 
with any non-accepted configuration. 

Theorem 2.2 (See, for example, [13]). For every recursive set X of natural numbers, 
that is accepted by a deterministic Turing machine M there exists a universally halting 
deterministic Turing machine M' with one tape accepting X and polynomially equivalent to 
M. 

Lemma 2.3. Let M be a deterministic sym-universally halting Turing machine. Then there 
exists a one-tape deterministic sym-universally halting Turing machine M' recognizing the 
same language as M . The machine M' is polynomially equivalent to M . 

Proof. The proof is by inspection of the proof from [T7]. □ 

Theorem 2.4. For every recursive set of natural numbers X there exists a sym-universally 
halting Turing machine M" with one tape that recognizes X . The machine M" satisfies the 
following conditions. 

(a) For every configuration c of M" either c is equivalent to an input configuration or 
every computation o/Sym(M") starting with c has length at most 0{\c\). 

(b) If c, d are two distinct input configurations of M" such that c =m" c ' ■ Then either 
c = d or both c,d are accepted by M" . 
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(c) If M is any Turing machine recognizing X then we can assume that M" polynomially 
reduces to M. 

Proof. Let M be a deterministic universally halting Turing machine with K tapes recogniz- 
ing L. Consider a new Turing machine M' constructed as follows. M' has one more tape 
than M', called the history tape. Its alphabet A' is in one-to-one correspondence with the 
set of commands P of M: A' = {[9], 9 € P}. Its state alphabet consists of two letters Qj(+i 
and {"m. With every command 9 of M we associate the command 9' of M' . It does what 
9 would do on the first k tapes of M' and inserts [9] on the history tape of M. After the 
first K tapes of M' form an accept configuration, the machine erases the history tape and 
stops (turns q l K+l into the stop state q° K+l ). Let P' be the program of M' . Now modify 
M' further to obtain a new Turing machine M" . The program P" of M' contains a copy 
P (the set of the main commands) of P' and some new, auxiliary, commands. After each 
main command of P, M" executes the history written on the history tape backward, with- 
out erasing the history tape: it just scans the tape from left to right, reading the symbols 
written there one by one and executing on the first K tapes the inverses of the commands 
written on the history tape. The commands that do that will be called auxiliary. If at the 
end of the scanning the history tape, it reaches an input configuration, M" executes on 
the first K tapes the history written on the history tape in the natural order (scanning the 
history tape from right to left). After that M" is ready to execute the next main command. 
We do not give precise definition of the program of M" because it is obvious on the one 
hand and long on the other hand. Clearly, the state alphabet of M" must be bigger than the 
state alphabet of M'. The machine M" is deterministic, universally halting, and recognizes 
the same language L. 

Let us prove properties (a) and (b) of the theorem. Since M" is deterministic, every 
reduced (i.e. without mutually inverse consecutive commands) computation O of Sym(M") 
is of the form BiO^" 1 for some computations Gi, 02 of M" (because a command of (M")" 1 
cannot be followed by its inverse). 

Let us show that Sym(M") halts when it starts with any non-accepted configuration (and 
then apply Lemma [2.3p . Let w be a configuration of M" that is not accepted by M". Since 
M" is deterministic, every computation of Sym(M") starting at w is a concatenation of a 
computation of M" followed by a computation of (M") _1 (i.e. the machine M" where every 
command is replaced by its inverse). Since M is universally halting, there are only finitely 
many computations of M" starting with w. Thus we only need to show that there are 
finitely many computations of {M")~ l starting with w, or, equivalently, that there are only 
finitely many computations of M" ending with w. Suppose that there are infinitely many 
computations of M" ending with w. Then, by definition of M" there must exist infinitely 
many input configuration inp(it) of M" for which there exists a computation of M" starting 
with inp(u) and ending at w. But that is impossible because such inp(u) is unique and is 
obtained by applying the inverse of the history written on the history tape of M" to w. 

(c) The fact that M" polynomially reduces to M is proved as follows. Consider two 
configurations w, w' of M". If w is not equivalent to an input configuration, then by (b) we 
need to check only whether w' is one of O(|io|) words that belong to the longest computation 
of Sym(M") containing w. That can be done in polynomial time without using the oracle 
checking equivalence of configurations of M. Suppose that both w and w' are equivalent 
to input configurations u,u' of M" . Then we can find u,u' in polynomial time and their 
lengths at at most 0(|"U>| + \w'\). If u 7^ u' and either u or u' is not accepted, then by (b) 
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w is not equivalent to w. If u = u' , then w is equivalent to u. Thus w is equivalent to 
w' if and only if u and v are accepted. To check that u is accepted, we need to remove 
letters corresponding to the extra tape from u producing a configuration u\ of M and check 
whether u\ is accepted, i.e. whether u\ is equivalent to the stop word of M. This can be 
done by asking the oracle once. Thus to check whether w and w' are equivalent we only 
need polynomial time and asking the oracle about equivalence of two pairs of configurations, 
the lengths of which are bounded by \w\ + \w'\. Thus M" polynomially reduces to M. □ 

2.3. Minsky machines. The hardware of a iT-glass Minsky machine, K > 2, consists of K 
glasses containing coins. We assume that these glasses are of infinite height. The machine 
can add a coin to a glass, and remove a coin from a glass (provided the glass is not empty). 
The commands of a Minsky machine are numbered. So a configuration of a -fT-glass Minsky 
machine is a K + 1-tuple (i; e\, . . . , ex) where i is the number of command that is to be 
executed, ej is the number of coins in the glass #j. 

More precisely, a command has one of the following forms: 

• Put a coin in each of the glasses ...,n/ and go to command ff j. We shall 
encode this command as 

«;->■ Add(ni, j 

where i is the number of the command; 

• If the glasses ##m, ni are not empty then take a coin from each of these glasses 
and go to instruction # j. This command is encoded as 

i;e ni > 0,...,e ni > ->■ Sub(m, ...,ni);j; 

• If glasses ##ni,...,nj are empty, then go to instruction # j. This command is 
encoded as 

i-e ni = 0, ...,e n; = -> j; 

• Stop. This command is encoded as i; -4 0; 

Remark 2.5. This defines deterministic Minsky machines. We will also need non-deterministic 
Minsky machines. Those will have two or more commands with the same number. 

Theorem 2.6. Let X be a recursively enumerable set of natural numbers. Then the follow- 
ing holds: 

(a) there exists a 2- glass deterministic Minsky machine MM 2 which recognizes L in 
the following sense: MM 2 begins its work in configuration (l;2 m ,0) and stops in 
configuration (0; 0, 0) if and only if m € X , and it works forever ifm^X. 

(b) There exists a 3-glass Minsky machine MM% which when started on a configuration 
(1; m, 0, . . . , 0) stops in the configuration (qo, 0, 0, . . . , 0) provided m £ X, and works 
forever otherwise. 

(c) We can also assume that every computation of MM2 or MM 3 starting with a con- 
figuration c empties each glass after at most 0(\c\) steps. 

(d) If X is recursive, then the machine MM 3 above can be chosen to be sym-universally 
halting. 

(e) If M is a deterministic Turing machine recognizing X, then we can assume that 
MM 2 (resp. MM 3) polynomially reduces to M where the numbers written on the 
tapes of M are measured as represented in unary (that is the size of a number n is 
set as n and not log 2 n). 
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Proof. The proof of the 2-glass part can be found in |36j . Let us prove the 3-glass part 
of the theorem. Let M be a one tape deterministic Turing machine M recognizing L. 
Without loss of generality we can assume that the tape alphabet of M is {1, 2}. For every 
configuration auqivu of M we can view u and v as numbers written in 3-ary, where v is read 
from right to left. Let us denote these numbers by l(u),r(v)v. For example the numbers 
corresponding to the configuration aYl\Tl\q^\TXlLO are l(u) = 1212213 and r(v) = 222I3, 
both written as 3-ary numbers. Thus with the configuration auqiVto, we associate the 
following configuration of a 3-tape Minsky machine (i;l(u),r(v),0). Now every command 
of a Turing machine can be simulated by a series of commands of the Minsky machine. 
For example, the command of the form l(/j2 — > qjl is interpreted by a sequence M{9) of 
commands of MM 3 as follows. The commands of M{9) will be numbered i.l through i.l for 
some I. The commands from M{9) should replace l(u) coins in the first glass by |^( u )/3j 
coins provided l(u) = 1 mod 3 and replace r{v) coins in the second glass by 3|_?"(u)/3j + 1 
coins provided r(u) = 2 mod 3. The first part is done as follows. Decrease the number of 
coins in the first glass by 3 simultaneously increasing the number on the third glass by 1. 
Do that until the number of coins in the first glass is less than 3. If that remaining number 
is 1, then subtract 1 coin from the first glass, and then keep adding 1 coin to the first glass, 
removing 1 coin from the third glass until the third glass is empty. If the remainder is not 
1, then keep removing one coin from the third glass while adding 3 coins to the first glass - 
until the third glass is empty (i.e. return to the original configuration because the command 
9 is not applicable). The second part is done in a similar manner by using the second and 
third glasses of the Minsky machines. Other commands of the Turing machine are treated 
in the same manner. Let MM3 be the resulting 3-tape Minsky machine. It is easy to see 
that if the Turing machine M is sym-universally halting, then the Minsky machine MM3 is 
sym-universally halting. This gives properties (a),(b) and (d) of the theorem. 

To ensure Property (c), we can do the following. Note that after every series of commands 
M{9) the configuration of MM 2 or MM 3 has (at least) one empty glass. After the series 
of commands M{9) of MM 2 or MM3 corresponding to a command 9 of M is executed, 
that glass is again empty. So before MM 2 or MM3 execute the next series M{9') we force 
it to move all coins from each of the non-empty glasses to the empty one and back. In 
the process, it will empty each glass at least once. Clearly, this modification increases the 
length of computation by an amount proportional to the length of configuration of MM2 
or MM 3 . 

Finally property (e) is obtained as follows. Suppose that w, w' are two configurations of 
MM 3 (for MM 2 the proof is similar). By construction (see [36J) in at most O(|io|) steps 
of MM3 either w turns into a configuration corresponding to a configuration of the Turing 
machine M or MM3 halts. In the latter case, we check whether w is equivalent to w' 
in 0(|ui|) steps. So we can assume that both w and w' are equivalent to configurations 
corresponding to configurations u,u' of the Turing machine M whose lengths are 0(|u>| + 
\w'\). Now w is equivalent to w' if and only if u and v! are equivalent configurations of M. 
Thus we need to use the oracle once. □ 



3. Simulation of Minsky machines by semigroups 

3.1. The construction. Here we will show how to simulate a Minsky machine by a semi- 
group. All applications of Minsky machines are based on the following idea. 
First, with every configuration i/j one associates a word (term) w(i/j). 
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Then with every command k of the Minsky machine M one associates a finite set of 
defining relations R K . The algebraic structure A(M) will be defined by the relations from 
the union R of all R K (which is finite since we have only a finite number of commands) and 
usually some other relations Q which are in a sense "independent" of R. We need Q, for 
example, to make sure A(M) satisfies a particular identity. 

We say that the algebra A(M) simulates M if the following holds for arbitrary configu- 
rations ipi,ip2 of M: 

(1) tpi =m if and only if ui{ip\) = w(ip2) in A(M). 

Usually, in order to prove the property ([I]) one has to prove the following two lemmas. 

Lemma 3.1. If a configuration c' can be obtained from a configuration c by a command n 
of M then the word w(c') can be obtained from the word w(c) by applying defining relations 
of A(M) from the set R K . 

Lemma 3.2. // a word w(c') can be obtained from a word w(c) by applying the defining 
relations of A{M) then c =m c' . 

It is easy to see that Lemmas 13.11 and 13.21 imply property ([I]). 

There is an easy way to interpret Minsky machines in a semigroup S(M). Let M be a 
Minsky machine with K glasses and commands ##1, 2, N, 0. Then S(M) is generated 
by the elements qo, . . . , qjy and {a,, Aj, i = 1, K}. The set of defining relations of S(M) 
consists of all commutativity relations 

(2) CLiCLj = Ojdj, QiAj = Ajdi, AiAj = AjAi, i ^ j, 
which we shall call commutativity relations, the stop relation 

(3) qo = (i.e. q$x = xqo = qo for every generator x), 

all relations of the form xy = where xy is a two-letter word which is not a subword of a 
word of the form fta^ 1 ...a 6 ^ A\...Ak modulo the commutativity relations ([2]), (for example 
qiqj = Aiai = aiqj = Aiqj = 0), which we shall call 0-relations, and relations associated 
with commands of M according to the following table, 



(4) 



Command of M 


Relation of S(M) 


i Add(ni, ...,n m ); j 


Qi — QjQ"ni-"Q"nm 


i,e ni > 0, ...,e„ m > -)■ Sub(n 1; n m ); j 


Qi a ni ■ ■ ■ a n m = Qj 


i,e ni = 0, ...,e„ m =0^j 


QiA ni . . . A nm = qjA ni . . . A rim 



These will be called the Minsky relations. 

The words in S(M) corresponding to configurations of M are the following: 

w(i;ei, ...,e K ) = qia^ ...a e £ A X ...A K . 
The proof that Lemmas 13.11 and 13.21 hold in S(M) follows easily from Lemma 12. 1[ see 
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Lemma 3.3. Suppose that a word W is not in S{M), is a subword of w(i; e±, ex) (up 
to the commutativity relations and does not contain either qi or one of the Aj. Then 
there are at most 0(\W\) different (up to the commutativity relations) words that are equal 
to W in S(M). All these words are subwords of words of the form w(i'; e\, ex) such that 
the configurations (i; e\, ex) and (%' , e[, e' K ) of M are equivalent. 

Proof. Since W ^ in S(M), the stop relations do not apply to W or to any word that is 
equal to W in S(M). If W does not contain qi, then the only relations that apply to W are 
the commutativity relations, so the only words that are equal to W in S(M) are the words 
obtained from W by the use of commutativity relations. 

Suppose that W contains qi but does not contain one of the Aj. 

Without loss of generality, we can assume that W contains every letter from w(i; e\ , . . . , ex) 
except some of the -Aj's. 

Every application of the Minsky relation to W corresponds to a command of the Minsky 
machine, applied to the configuration c = (i; e\, ex)- Let c = c\ — > C2 — > ... be any 
computation of Sym(M) starting with c. Then the sequence of commands of M applied 
in that computation has the form 9\ . . . Q n ®^X\ ■ ■ ■ O^ 1 where 6 S are commands of M (by 
Lemma 12. ip . If this sequence can be applied to W, then this computation never checks 
whether glass Jfj is empty. By Property (c) of Theorem 12. 6[ both n and k must be at most 
0(| W\). This implies the statement of the lemma. □ 

3.2. Residually finite finitely presented semigroups. 

Lemma 3.4. Every non-zero element of S(M) is represented by a subword of a word of the 
form w(i;ei, ...,e K )- 

Proof. This follows from the commutativity relations and O-relations. □ 

Lemma 3.5. Suppose that the Minsky machine M is sym-universally halting. Then 

(a) Every non-zero element z of S(M) has finitely many divisors, i.e. elements y such 
that z = pyq for some p,q G S(M) U {1}. 

(b) For every configuration tp of M the word w(i/j) is equal to in S(M) if and only if 
is accepted by M . 

Proof, (a) This follows from Lemmas 12.11 13.11 and 13.21 for words containing a ^-letter and 
all letters Aj, and from Lemma 13.31 in all other cases since word in the generators of S(M) 
that is non-zero in S(M) is a subword of one of the words corresponding to configurations 
of the Minsky machine M by Lemma 13.41 

(b) This follows from Lemmas 13.11 and 13.21 □ 

Lemma 13.51 immediately implies 

Lemma 3.6. For every R > let Vr be the set of all elements of S(M) that do not divide 
in S(M) non-zero elements represented by words of the form qja^ 1 . . . a e £ A\ . . . Ax with 
Ej < R. Then Vr is an ideal of S{M) with a finite complement. If M is sym-universally 
halting, then the intersection of all Vr,R > 0, is {0}. 

Theorem 3.7. For every recursive set of natural numbers Z there exists a residually finite 
semigroup S whose word problem is at least as hard as the membership problem in Z. The 
Dehn function of S is equivalent to the time function of a 3- glass Minsky machine recognizing 
Z. 
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Proof. By Theorem 12.41 there exists a sym-universally halting Turing machine that rec- 
ognizes Z. By Theorem 12.61 there exists a sym-universally halting Minsky machine M 
recognizing Z. By Lemma 13.51 the problem of recognizing equality to in S(M) is at least 
as hard as the membership problem in Z. By Lemma 13.61 S(M) is residually finite. □ 

3.3. Residually finite semigroups with large depth function. Recall the definition 
of the depth function p: for every finitely generated residually finite universal algebra A and 
every number n, PA(n) is defined as the smallest number such that for every two different 
elements z, z' in A of length < n there exists a homomorphism from A onto a finite 
algebra B of cardinality at most PA{n) such that 4>(z) ^ 0(n'). 
The following lemma is well known |22j 

Lemma 3.8. Suppose that every non-zero element of a semigroup S with has finitely 
many divisors. Then S is residually finite. 

Proof. Indeed, the set of all non-divisors of a non-zero element is an ideal with finite quo- 
tient. The intersection of all these ideals is {0}. □ 

Theorem 3.9. For every recursive function f there exists a finitely presented residually 
finite semigroup S such that ps(n) > fin) for all n. In addition, we can assume that the 
word problem in S is as hard as the membership problem for any prescribed recursive set of 
natural numbers. 

Proof. Let M be a sym-universally halting Minsky machine with K glasses and N +1 com- 
mands numbered 0, . . . , N . Consider the following new, non-deterministic Minsky machine 
M n . Its hardware consists of the K glasses of M plus two more glasses. In every command 
of M we add the instruction to add a coin to glass K + 1 provided glass K + 2 is empty. 
Also for every i = 0, N we add two new commands number i 

(5) i;Add(K + 1,K + 2) -H 
and 

(6) i;e K +i = 0,e K +2 = 0, -> 

Thus there will be three commands for each i = 1, . . . , N: one from M and the two new 
ones. The new command ([5]) allows us to add, at any step of the computation, equal (but 
arbitrary) number of coins in glasses K + 1 and K + 2, and if both glasses K + 1 and K + 2 
are empty, the computation can stop. But we can execute a command of M only when the 
glass K + 2 is empty, so a new command cannot be followed by a command of M. 

Let us say that the commands coming from M have weight 1 and new commands ([5]) , ([6]) 
have weight 0. The weight of a computation is then the sum of the weights of all commands 
used in the computation. We also define the weight of a configuration as the number of 
coins in the first K+ 1 glasses minus the number of coins in glass K + 2. Every computation 
C of Sym(M n ) projects onto a computation 7r(C) of Sym(M): we simply forget the extra 
two glasses and the new commands. The weight of C is equal to the length of 7r(C). The 
numbers of coins used in C and 7r(C) in the first K glasses are the same, the number of 
coins in glass K + 1 in the last configuration W of C minus the number of coins in glass 
K + 2 of W is equal to the weight of C. 
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Also any computation C of M lifts to (possibly infinitely many) computations C n of M n , 
the weight of each C n is the same as the length of C, and the number of coins used in the 
first K glasses is the same. 

Note that Lemma 12. II still holds for M n even though M n is non-deterministic. It can be 
easily established by using the projection n. 

This implies that if M is sym-universally halting, then for every configuration W of M n 
the weights of all computations C without repeated configurations of Sym(M n ) are bounded, 
the number of coins in the first K glasses of M n used during any of these computations is 
bounded, and the weights of configurations appearing in these computations are bounded. 

Consider the semigroup S{M n ). Every non-zero element w in S(M n ) is represented by a 
word of the form u{w)v{w) where 

u(w) = <£°af . . . a 1 - AT . . . A^ Mw) = a l £X\a l ££A a ™ 

where ctj £ {0, > 0. Note that if two non-zero words w, w' are equal in S(M n ), then 
u(w) and u(w') are equal in S(M). 

We claim that S(M n ) is residually finite. Indeed, consider two words w\,W2 in the 
generators of S(M n ) which are not equal in S(M n ), does not divide w\ in S(M n ) (clearly 
Wi and W2 cannot divide each other without being equal in S{M n )). 

Suppose first that w\ does not contain a g-letter. Then consider the ideal Q of S{M n ) 
generated by all g-letters. The inequality w\ ^ W2 survives in the Rees factor-semigroup 
S(M n )/Q. But in S(M n )/Q every element has finitely many divisors, hence S(M n )/C is 
residually finite by Lemma f3.8| and so we can separate w\ and W2 by a homomorphism onto 
a finite semigroup. 

Thus we can assume that w\ starts with a g-letter qi. Suppose that u(w\) ^ u{w2) in 
S(M). Adding the relation a 2 K+l = ort+i, a^ +2 — a< K+2 t° S{M n ) we then obtain a new 
semigroup S{M n ) and a homomorphism 4>: S{M n ) —¥ S{M n ) which separates w\ and W2- 
In the semigroup S(M n ), every non-zero element has finitely many divisors since it is true 
for S(M) and the number of different elements of the form v(w) is finite. Hence S(M n ) is 
residually finite by Lemma 13.81 

Thus we can assume that u(wi) = u{w2)- Let 

v(wt) = a™\ ia ™\ 2 A^A^, D = max{|/ x+1 - l K+2 \ + 1, \m K +i ~ m K+2 \ + 1}. 

Let us add the relations a^ +1 = a|^_ l5 a^ +2 = ^° S{M n ). Let S(M n ) be the resulting 
semigroup, and ijj: S{M n ) —¥ S{M n ) be the corresponding homomorphism. Then it is easy 
to see that ip(wi) ^ ijj{w2)- Since in S(M n ), every element has finite number of divisors 
(the same argument as for S(M n )), we can again use Lemma 13.81 

The function p{n) for the semigroup S(M n ) is at least as large as the following function 
ty(n) associated with the machine M: $?(n) is the smallest number such that for every non- 
accepted input configuration of M of length < n, the machine M halts after at most Vl/(ra) 
steps (i.e. the co-time function of M). Indeed let c be an input configuration of length at 
most n such that M halts after exactly ty(n) steps starting at c. Suppose that the word w(c) 
in S(M n ) corresponding to the configuration u can be separated from in a homomorphic 
image E of S{M n ) with at most ^(n) — 1 elements. Then the images of a^+i, clk+2 in that 
semigroup satisfy z D = z 2D for some D < T(n). Since the halting computation has > D 
steps, the letter ax+i occurs in w(u) exactly once, and every command of M n corresponding 
to a command of M adds one coin in glass K + 1, there exists a word W which is equal to 
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w(u) in S(M n ) and which has the form 

qja l l . . . a^Ai . . . A K (Ik +1 Ak+iAk+2- 
Modulo relations corresponding to the commands ([5]), this word is equal to 

qja^ . . . a£A\ . . . A K a 2 I ^ +1 A K+1 aK+2^-K+2- 
The image of the latter word in E is equal to 

qja 1 ^ . . . a£A\ . . . A K aK +1 A K+1 a^ +2 A K+2 
which, again modulo the relations corresponding to the commands ©, is equal to 

qjd l l . . . a^Ai . . . A K A K+1 A K+2 

which is equal to by the relations corresponding to the commands (0), a contradiction. 

Note that the co-time function of a Turing machine recognizing a recursive set can be 
larger than any given recursive function. Indeed, after the machine halts without accepting, 
we can make it work as long as we like. It remain to note that the co-time function of a 
Minsky machine simulating that Turing machine cannot be smaller. □ 

4. Simulation of Minsky machines in solvable groups 

Recall that a variety of algebraic structures is a class of all algebraic structures of a 
given type (signature) satisfying a given set of identities (also called laws). Equivalently, 
by a theorem of Birkhoff [36] a variety is a class of algebraic structures closed under taking 
cartesian products, homomorphic images and substructures. Every variety contains free 
objects (called relatively free algebraic structures). One can define algebraic structures that 
are finitely presented in a variety as factor-structures by congruence relations generated 
by finite number of equalities. Every finitely presented algebraic structure which belongs 
to a variety V is finitely presented inside V but the converse is very rarely true. See [32] 
for a survey of algorithmic problems for varieties of different algebraic structures (mostly 
semigroups, groups, associative and Lie algebras). In this section we concentrate on varieties 
of groups. The most well known varieties are the variety of Abelian groups A given by the 
identity [x,y] = 1, the variety of nilpotent groups of class c, N c given by the identity 
[...[xi, X2], ...,x c+ i] = 1, etc. The class of Abelian groups of finite exponent d, Ad, is also a 
variety, given by two identities [x, y] = 1, x = 1. 

If IA and V are two varieties of groups then the class of groups consisting of extensions of 
groups from U by groups from V is again a variety (the product oili and V) denoted by UV. 
The product of varieties is associative [45]. For example the variety of all solvable groups 
of class c is the product of c copies of the variety A. If V is a variety of groups, then ZV 
is the variety consisting of all central extensions of groups from V. For example A/2 = ZA 
and, more generally, N c +\ = ZM C for every c > 1. 

The problem of finding a finitely presented group with undecidable word problem, be- 
longing to a proper variety of groups (i.e. satisfying a non-trivial identity) was formulated 
by Adian [M] and solved by the first author in [28]. The construction was simplified in 
the unpublished dissertation [29] . In this section, we shall modify the construction from 
[29] to construct residually finite finitely presented solvable groups with complicated word 
problem. 
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4.1. The construction. Let M be a Minsky machine with K glasses and N + l commands 
(numbered 0, N). We are going to construct a group G(M) simulating M. The group 
G(M) will be in a sense similar to the semigroup S(M) constructed above. The main idea 
will be to replace the product by another operation and make sure that with respect to the 
new operation the semigroup S(M) "embeds" into our group. 

Thus the group will be generated by the g-letters which will be related to the letters 
from S(M), and also a-letters a±,...,aK, ^4- letters, A±,...,Ak and some extra a- and A- 
letters that help us impose the necessary commutativity relations that, in particular, make 
the group solvable. The group we are going to construct will be a semidirect product of 
the Abelian normal subgroup generated by the (/-letters by the semidirect product of an 
Abelian subgroup generated by ^4-letters and an Abelian subgroup generated by a-letters. 
Thus we should have a way to ensure that in a subgroup generated by two sets of letters 
Z L)Y, the normal subgroup generated by Z is Abelian. This is done with the help of the 
following lemma due to Baumslag [3] and Remeslennikov [52]. In that lemma we denote 
u a = a~ 1 ua and u a+b = u a u b (note that although u a+b is not necessarily equal to u b+a , the 
equality will hold if the normal subgroup generated by u is Abelian, which is going to be 
the case every time we apply this lemma). 

Lemma 4.1 ([31 [52]). Suppose that a group H is generated by three sets X, F = {ai | i = 
1, . . . , m}, F' = {a[ | i = 1, . . . , m} such that 

(1) The subgroup generated by F U F' is Abelian; 

(2) For every a G F and every x G X we have x^ a ^ = x a for some monic polynomial f 
of a which has at least two terms (in all our applications f(t) = t — 1); 

a a l a am 

(3) [x^ m , X2} = 1, for every x\, X2 G X, and every «i, a m € {0, 1, — 1}. 

Then the normal subgroup generated by X in the group H = (X U F U F') is Abelian, and 
H is metabelian. 

If the elements Gtj and a\ and the set X satisfy the conditions of Lemma 14.11 we will call 
a[, i = 1, ...,m, are BR-conjoints to a. L with respect to X (and the polynomial /). 

Consider the free commutative monoid generated by letters Aq, ...,Ak- Let Uo be the set 
of all divisors of the element AqAi...Ak in that monoid, and U be the set of all symbols 
qjiv, w G Uo,j = 0, N. Also fix a prime p (say, p = 2). 

The generating set of our group G = G(M) will consists of three subsets: 

L = {x u ,u G U, i = 0, N}; 

L x = {A u i = 0, ...,#,}; 

L 2 = {ai, a-, a,i, d'i, i = 1, . . . , K}. 

We introduce notation for some subgroups of the group G. Denote Hi = (Li),i = 0, 1, 2. 
Denote also 

M = {aj,5-,A),2 = 1, ... ,K}, Mi = {ai,a'i,Ai},i = 1,...,K 
The group G{M) has the following set of defining relations: 

Gl. Relations saying that Hq and Hi are Abelian groups of exponent p, and H 2 is an 
Abelian group. 

G2. Any y G Mi, z G Mj, i / j G {0, K}, commute. 

G3. For every i = 1, . . . ,K, (a[)~ l is a BR-conjoint to a" 1 with respect to {Ai} (and 
polynomial f(t) = t — 1). 
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G4. The elements of the set {(aQ _1 ,i = 1, ...,K} are a BR-conjoints to elements of the 
set {a~ l ,i = 1,...,K} with respect to {Aq}. 

G5. a) If u G U does not contain A; L for some i = 0,...,K, then [x u , Aj\ = x uAi , 
j = 0,...,K. 

b) For every i = 1, . . . , K , if u does not contain Ai, then x"'" 1 = x"' (see notation before 
Lemma 14. 1\ 

c) For every i = 0, . . . , K, if u contains ^4j, z € Mj, then [x u , z] = 1. 
G6. xg = 4 = 4, i = 0, iV, i = 1, . . . , if. 

G7. [x*, x„] = 1, where z = a" 1 . . . , a^, € {—1, 0, 1}. 

Remark 4.2. Relations G7 together with Gl and G5b) imply that for every subset I C 
{1, . . . ,K} the letters {a^i G 7} are BR-conjoints of {a,,i G 7} with respect to the set of 
all XuS where u does not contain letters Ai,i € I. 

G8. Relations constructed from the program of the machine M. For every / G G denote 
also let 

f*Ai = [f,Ai},i = 0,...,K. 
We denote (...(ii * £2) * •••) * tk by ii * • • • * ifc, and ix * *2 * ••■ * *2 by £1 * t\ n ■ The relations 

corresponding to the commands of M are in the following table. 



(7) 



Command of M 


Relation of C(M) 


i -> Add(ni, . . . ,n m ); j 


x cjiAo = X <?,A * a ni * ••• * a n m 


i,e m > 0, . . . ,e„ m > ->■ Sub(m, . . . ,n m );j 


X qi A * Q*ni * ■ ■ ■ * a n m = x qjA 


i,e m = 0, ...,e nm = -> j 


X qi A * A ni * ... * A Um 

= x g, Ao * A11 * • • • * A nm 



Theorem 4.3. (a) The group G(M) belongs to A 2 p An ZM K +i-A. 

(b) The equality 

x qiAo * aS mi) * . . . * A™ * ... * A™ = x gjAo * af ni) * ... * * * . . . A^ k) 

where eti,f3i G {0, 1} is true in G{M) if and only if the equality 

Qi a^ . . . a™ K A? ...A°* = Qj a^ . . . a™ K A^ . . . A** 

is true in the semigroup S(M) (in particular, ai = for every i). 

(c) The equality 

x qi * af mi) * . . . a^ K) * A[ ai) * ... * A% k) = x qj * a^ l] * ... * o^ k) * A^ *... A^ k) 
where cti,(3i G {0, 1} is true in G(M) if and only if rm = rii,a,i = (3i for every i. 
Proof. First we will prove part (a): G(M) G A 2 p A n ZAf 3 A. 

Lemma 4.4. The subgroup {Hi U H2) of G is metabelian and a semidirect product of the 
Abelian normal subgroup H^ 2 of exponent p, and H2. 



ALGORITHMIC ALLY COMPLEX RESIDUALLY FINITE GROUPS 



21 



Proof. Indeed by relations G2, 

K 

(Mi,i = Q,...,K) = Y[{ai, a't, Ai) x {di,d'i, A ,i = 1,...,K). 
i=i 

Using relations Gl, G3, G4, we can apply Lemma 14.11 to each of the factors in that direct 
product and conclude that each of them is metabelian and a semidirect product of the 
Abelian of exponent p normal subgroup generated by the intersection of {Ai, i = 0, . . . , K} 
with that factor, and the Abelian group generated by the a-letters from that factor. □ 

Lemma 4.5. The normal subgroup T of G generated by all the elements x u ,u E U, is 
Abelian of exponent p. 

Proof. Relations G5 a) of the group G imply that every element x u ,u E U is a product of 
elements x z q . , z € Hi, i = 0, . . . , N. Therefore, it is enough to show that 

(8) Xq k X qt = X qt Xq k 

for any z E (Hi,H2) and any k,t. To reduce the proof of these equalities to the proof 
of more simple equalities notice that z = ZqZi ■ ■ ■ zk where zi E (Mi) by G2. Therefore 
equalities ([8]) are equivalent to 

(Q\ rr^r-ZX—ZK — rf.Zl---ZKrr.ZO 

v y / ^qk^qt ~ x qt x qk- 

a p (a'.) q 

We can represent element x Zi ., i > 1, as a product of elements of the form x q ^ 1 and 
x q]A- ■ Indeed we have the following sequence of equalities deduced using G2, G5, G6: 



(10) 



a ri (a'.) s iA tl ...a rs (a l .) s kA tk G6 a r i (a') s i A*i ...a k (a') Sf = 
X qj X q . 

Hi 

G5 a), c), G 6 {a'.)^ ...a? (qQ'* A** , tl tf? (a^l _ 

Repeating this argument K times, one proves that x q ^ j Z2 '" ZK can be represented as a product 
of elements of the form Xu where u E U, y € Hi. A similar proof (using also G4) gives that 
x z ° is a product of elements of that form. It remains to note that elements of the form Xu, 
u E U, y E H-2 commute by Remark 14.21 and Lemma 14.11 □ 



Remark 4.6. Note that equalities (|10p and similar equalities when x qj is replaced by x u , 
u E U, imply the following: if y is a product of elements of the form a 1 - 1 (a'-) Sl Ai and 
J2i r i = s i = 0> then [x u , y] is 1 if u contains Ai or a product of conjugates of elements 
%uAi by elements from (di) x (a-) otherwise. Similarly, suppose that y is a product of 
elements from Mq, each factor containing Aq, and the total exponent of every di (resp. a'-) 
is 0. Then = 1 provided u contains Aq and is a product of conjugates of x u a by 

elements from (ai,a'-) provided u does not contain Aq. 



By construction, the group G is a semidirect product of T and the metabelian group 



H^ 2 x Hi- By Lemma 14.41 G is solvable of class 3 and, moreover, belongs to AlA 
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Remark 4.7. The proof of Lemma 14.51 shows that T is generated (as an Abelian group) by 
elements of the form Xu where u G U and y G H 2 . 

Lemma 4.8. The quotient of G(M) over the center satisfies the identity 

[[xi,yi},[x 2 ,y 2 ], ■ ■ ■ ,[xk+2,VK+2]] = 1. 

This means that G belongs to the variety ZNk+\A. 

Proof. Let P be the derived subgroup of G(M). By Lemma 14.51 every element of P is a 
product of an element of T and an element of Iff 1 2 . It also follows from Lemma 14.51 that 
[P, P] C T, hence by Remark 14.71 it is generated by elements of the form x y u , u G U,y G H 2 , 
the word u contains at least one Ai, i = 0,...,K. Since T is Abelian, the subgroup 
[P, P, . . . , P] is generated by the commutators 



K+2 



r y ih.2,1 h h 2 , K] 
L x u; "-1,1 1 n \,K \ 



for some € Hi, y, h 2 ^ G H 2 . An easy induction shows that every such commutator is a 
conjugate of 

(11) [x u ,hl 1 ,...,hl K ] 
where y' G H 2 . 

Let h G Hi, u G U, y G H 2 . Suppose that h = ■ ... ■ A\ s s where U ^ 0. Consider [x u , h v \. 

Then Remark 14.61 implies that [x u , h v ] is a product of elements of the form x y u , where u' G U 
contains letters A^ , A^ 3 and it may not be equal to 1 only if one of the letters A^. does not 
occur in u. Therefore the commutator (jlip is either equal to 1 or is a product of elements of 
the form x v u , where the word v! G U contains all letters Aq, Ai, ...,Ak, y" G H 2 . But every 
such x u t is in the center of G(M) by G5 c). Hence [P,...,P] is contained in the center of 

K+2 

G(M). □ 

We now prove (b) and (c). For this, as we mentioned before Lemma 13.11 we need to 
prove Lemmas 13.11 and 13.21 Lemma 13.11 for G(M) is proved in the same way as for the 
semigroup S(M) (see [551 [32] , since the only property of S(M) used there was that the word 
w = qia l i . . . a£ A^ 1 . . . A^f is equal to any word obtained from w by permuting ai with aj, 
Ai with Aj and ai with Aj (i ^ j). The same is true for words of the form 

(12) x qiAo * 4 mi) * . . . a£*> * 4 Ql) * • • • * A% K) 

in G{M) by the definition of the operation *, relations Gl, G2 and Lemma 14.51 

In order to prove Lemma 13.21 we will define a new group G that is a quotient of G and 

injective on elements of the form (|12p . 

Let S be the semigroup with the same generating set as S(M) subject all the relations of 

S(M) except the relations corresponding to the commands of M (that semigroup does 

not depend on M). Thus non-zero elements in M have the form 

n a iJi n lli A ai A aK 

Hi a i ■■■a K A 1 ... Ji K 

where L G N, aj G {0, 1}. Let W be the set of all non-zero elements of S containing a 
^-letter, and Wo be the set of elements from W viewed as elements of S(M) (i.e. different 
words may represent equal element) with Aq inserted next to the g-letter. Consider the 
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free Abelian group T\ of exponent p generated by the elements Zi 1)m „ t i K)U , u 6 W U Wo, 
ij € {1, 2, 3}. For each element of L\ U L2, we define an automorphism of T\. The group G 
will be the semidirect product of T± and the group generated by these automorphisms. 

For simplicity we will denote automorphisms corresponding to letters from L\ U Li by 
the same letters. 

Let us start with automorphisms a,, a'. We have to define zf' „ and 2..* „, for 
every £1, ...,£/<■. First suppose that u does not contain A,-. To simplify the notation we shall 
denote the vector (ii, ■ ■ ■ ,ix) by £, and the standard unit vectors by ei,l = 1,...,K. We 
shall write z* instead of Zi lt ... t i K)U . The j-th coordinate of i is denoted by ij. 



(13) 



l,U 







l,U 1- 






1 


z-> zZ 


l,U %- 




Z i~2e 3 


u 



ifij 
if 

if i; 



i; 

2: 
3. 



-1 a j 
i.u i.u 



If zi contains letter Aj, then let z- J 

J i.u 



Z^ J 
i,u 



i.u 



It is easy to prove that aj is an automorphism by constructing the automorphism a - 1 . 

If we apply aj 1 to the third equality in (|13p . we will obtain the formula for zJ provided 
ij = 1 (and u does not contain Aj). Plugging it in the second equality of (flUj) we obtain 



the formula for zJ provided z, 

i,u J 

obtain the formula for zJ provided ij = 3 



2. Finally plugging it in the first equality in (|13p . we 



Xf 

I.U 



-1 -1 -1 -r • o 

x^ x- , if ij = 3 

i—e.i,u i,u i,ua J 



X~? 1 — » - if % n 2 

i,u i+ej,u' J 



X i+2e ] ,ui ^ — 1- 



The automorphism cL is defined similarly. If zz contains Aq, then z 



l . u 



If u does 



a-; a-; 
= Z- . 

I.U I.U 



±0 ^ 3 

If u does not contain Aq but contains Aj, i.e. u = vAj for some v, then 



i,U j+ej,M i+2ej,ti i,va,jA 

z?„.z7 1 ^ , 



, if Zj = 1; 
if ij = 2; 
if ij = 3. 



a j -1 a i 
ZJ = 2- . 
i.u i.u i,u 



Finally the automorphisms corresponding to Aj, j = 0, ...,K, are defined as follows: 
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if u does not contain Aj and 



if u contains Aj . 



z^ J = z-> 

i.li t,U 



The following lemma is obtained by a straightforward application of the definition of the 
automorphisms above and the definition of the operation *. This lemma implies that G 
satisfies G8 if we replace x u by Zf (since the corresponding relations hold in S(M)). 

Lemma 4.9. The following relations hold in G. For every w £ {aj, Aj,j = 1, ...,K} 

Z U * w = z T,uw 

where we set z^ Q = 1 (the identity element in G(M)) where * is defined in G8. 

We define G as the semidirect product of T\ and the subgroup of Aut(Ti) generated by 
the automorphisms corresponding to the elements from L\\J L<i- From the definition of 
the automorphisms and Lemma 14.91 it follows that G is generated by the elements , 

u € U, where 1 is the vector (1,1,..., 1) and the automorphisms corresponding to elements 
of L\ U L2. It is easy to check that all the relations G1-G8 hold in G, therefore 

Lemma 4.10. The map that sends every a- or A-letter to itself and every x u to z^ u extends 
to a homomorphism <fi from G to G. 

Lemma 4.11. The homomorphism <f> is surjective. 

Proof. It is easy to see that we only need to define pre-images x^ of elements z^ u € G, 
w € W U Wq. By the definition of <f>, we have (fi(x u ) = Zf u for every u € U so we define 
The other preimages are defined by induction on the length of w and the sum 

of lj. 

Suppose w € W U Wo does not contain Aj and ij = 1, i! is arbitrary. Then we define: 

- x'^ 1 

— a J _1 
i+2e j,w i tW ' 

x~?i ^ a 1' — x^i > 
r ,w J %' ,waj 

We also have * Aj = x? A for any i. 

i,w J i,wAj J 

It is easy to see that for every i and w € W U Wo, we have (j*(x-* ) = z? . This proves 
the lemma. □ 

In G(M), consider the set P of elements 

(14) z tqiAo *a^K...a^UA^K...*A^ 
where cti € {0, 1} and the set Po of elements 

(15) z Ui * a[ mi) * . . . a ( ™ K) * A^ l] * ... * A% k) 

By construction P n Pq = 0, elements ([14p are different if and only if elements 



W\ ■■■a K 



A^ 1 A aK 
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from S(M) are different, and elements (|15[) are different if and only if the corresponding 
elements qia™ 1 . . . aTfA® 1 . . . A°^ of S are different. This completes the proof of Lemma 
EjZhnd Theorem 03] (b) , (c) . □ 

We shall need a few more properties of the group G(M). 

Lemma 4.12. Let elements x^ w ,w G W L) Wo from G be defined as in the proof of Lemma 

\JU\ Let y G L 1 U L 2 , w G W n W . Then for every i G {1, 2, 3}^'-'^>, x% satisfies the 

same equalities as elements zi from the definition of automorphism of G with z replaced 

by x everywhere. In particular, xH is a product of one or several elements of the form x-?, w , 

such that every letter a,- occurs in w' at least as many times as in w (in particular if for 
some R > 0, w belongs to the ideal Vr defined in Lemma \3.6\ then w' G Vr. 

Proof. For y G UMj,j > 1, this follows from the way x^ u are constructed. For y G Mq, one 
needs to use G2, G5 c), and G6. □ 

The proof of Lemma 14.121 actually gives the following 

Lemma 4.13. If v is a word in a- and A-letters (i.e. over L\ U Lz), then x% is a product 
in G of elements x^ as in Lemma where the length of each w does not exceed the 
length of v (hence the total number of different xj w occurring in this product is polynomial 
in terms of \v\. 

Lemma 4.14. The normal subgroup T generated by the elements x u ,u G U inG = G(M) is 
the direct product of cyclic subgroups generated by the elements x-> , i G {1, 2, S}''- 1 ''"'^, w G 

w u w . 

Proof. By Lemma 14.121 elements x^ w span T. We defined elements x^ w ,w G W in such 
a way that they are pre-images of the corresponding elements in G under (f>. Thus the 
elements x^ w ,i G {1, 2, S}^ 1 ''"'^^, w G W U Wq are linearly independent since their images 
under (j> are linearly independent in T\. □ 



Lemma 4.15. Let R > 0, Vr be the ideal of the semigroup S(M) defined in Lemma \3.b\ 
Then the subgroup T{Vr) ofT spanned by all the elements w G Vr,i G {1, 2, S}' 1 '" '^ 
is normal in G(M) (as before, we set x^ = 1) and of finite index in T. 

Proof. The first part follows from Lemma |4.12[ If {w\, ...,wm} is the set S(M) \ Vr, then 
{ X I w> * e ^' ^}^ 1 '"'' K \ W £ wm}} is a set of representatives of all cosets of T(Vr) 

in T. □ 



4.2. A finitely presented solvable group with undecidable word problem. By The- 
orem 12. 6( there exists a 2-glass Minsky machine which computes a non-recursive partial 
function. The corresponding group G(M) has undecidable word problem and belongs to 
the variety Az,A D ZM3A by Theorem 14.31 Hence we obtain the following: 

Theorem 4.16 (Kharlampovich [28]). There exists a finitely presented group with unde- 
cidable word problem that belongs to the variety A\A n ZN3A. 
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4.3. Residually finite finitely presented groups. 

Theorem 4.17. If a Minsky machine M is sym-universally halting then the group G(M) 
is residually finite. Its word problem is at least as hard as the halting problem for M . 

Proof. Let M be a sym-universally halting Minsky machine. Let w ^ 1 € G{M). We 
use the notation from the definition of G(M). There exists a natural homomorphism £ 
from G(M) to the metabelian group iif 2 x H2 which kills all elements from T. Since every 
finitely generated metabelian group is residually finite, we can assume that C{w) = 1. Hence 
w € T. By Lemma 14.141 x is a product of elements of the form 

(16) x, u , i € {1, 2, 3}^-' K \u € W U Wq. 

Hence w = wqW\ where wq (resp. w\) is a product of elements ([16]) with u € Wq (resp. W). 
Suppose that w\ is not 1. Let T' be the subgroup of G(M) generated by elements (fl~6|) with 
u € Wq. Then T is a normal subgroup of G{M) by LemmaSZGJ Let G'(M) = G(M)/T'. 
This group is a semidirect product of T/T' and the metabelian group iff 2 x fli. Let L> be 
the sum of lengths of words u £ W that appear in the factors of w\. Let Yd be the set of 
all words in S where at least one a-letter appears at least D times, and 0. Then Yd is an 
ideal in S, and the image of the set of elements (j 16j) with u £ Yd in G'(M) form a normal 
iV subgroup of G'(M) of finite index (because T is an Abelian group of finite exponent 
p). That normal subgroup does not contain w by Theorem 14.31 (c). Then G'(M)/N is a 
semidirect product of a finite group and the metabelian group iff 2 x if 2 . Hence G'{M)/N 
is residually finite and w can be separated from 1 by a homomorphism from G{M) onto a 
finite group. 

Finally suppose that w\ = 1. Let ui, u; be the elements from Wq that appear in the 
representation of w as a product of elements (I16p . Let £7 be the set of words that is equal 
to one of the Uj in S(M). Since M is sym-universally halting, E is finite. Let D be the 
maximal length of a word in E. Let, as above, Yd be the ideal in S consisting of and all 
elements where one of the a-letters appears at least D times. Let Zd be the set of non-zero 
elements of S(M) that are images of words from Yd under the natural homomorphism 
S —> S(M). Then Zd does not contain u±, ...,ui. Consider the subgroup F of T spanned 
by all elements (116[> with u G Zd U Yd- From Lemma 14.121 it follows that F is a normal 
subgroup of G(M) of finite index in T. Since Zd does not contain u±, ui, the subgroup 
-F does not contain w. The factor-group G{M)/F is a semidirect product of a finite group 
and the metabelian group iff 2 x ff 2) and we can complete the proof as above. □ 

Theorem 4.18. For every recursive function f , there is a residually finite finitely presented 
solvable of class 3 group G with Dehn function greater than f . In addition, one can assume 
that the word problem in G is at least as hard as the membership problem in a given recursive 
set of natural numbers Z or as easy as polynomial time. 

Proof. The statement follows from Theorems 14.171 and 13.71 □ 

4.4. Residually finite finitely presented group with large depth function. 

Theorem 4.19. For every recursive function f there exists a finitely presented residually 
finite group G from ApA n ZM3A such that pc( n ) > f( n ) for all n. In addition, we can 
assume that the word problem in G is as hard as the membership problem for any prescribed 
recursive set of natural numbers. 
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Proof. Consider the Minsky machine M n constructed in the proof of Theorem 13.91 Then 
as in the proof of Theorem 14.181 one can prove that G(M n ) is residually finite. The fact 
that pc( n ) > f( n ) is proved the same way as in the proof of Theorem 13.91 (one only needs 
to replace the product by operation * everywhere in that proof). □ 

5. Distortion of subgroups closed in the pro-finite topology 
Let us generalize the Mikhailova construction |43j . 

Let G be a finitely generated group generated by a finite set X, N < G a normal subgroup, 
generated as a normal subgroup by a finite set R = {rr, . . . , r^.}, and <j) '■ G — > G/N the 
canonical epimorphism. We may assume that both sets X and R are symmetric, i.e., 
X = X" 1 and R = J? -1 . The set 

E(G, N) = {(«, v) G G x G | 0(tt) = 0(f)} 

is a subgroup of G x G, called the equalizer of (</>, 0). 

In the following lemma we summarize the main components of Mikhailova's argument 
(though in a much more general situation). The proof is easy and we leave it to the reader. 

Lemma 5.1. In the notation above the following hold: 

• E(G, N) is generated by a finite set 

D = {(r, 1) | r G R} U {{x, x' 1 ) \ x G X} C G x G. 

• For any w G G if (w, 1) = (ui, Ui)(«2, ^2) ••• K, v n ) /or some (uj, vi) G -D, f/ien 
u\ . . .u n is of the form WsriWirzW? ■ ■ ■ w m -\r m w m for some Wi G G, n G R, m < n, 
satisfying wowi . . . w m = 1 in G, hence, 

m 

W = G JJ(^0 • • • Wi-l)n{w Q . . . Wi-i)" 1 . 
i=l 

In particular, the distortion of E(G, N) in G x G is at least as high as the Dehn 
function of G/N relative to N. 

Let V be a class of finite groups closed under direct products and subgroups. Recall that 
the pro-P topology on a group G has as its base the set of all normal subgroups N with 
G/N G V. 

Lemma 5.2. Let V be a class of finite groups closed under direct products and subgroups. 
In the notation above if the group G/N is residually V then the subgroup E(G,N) is closed 
in the pro-V topology on G x G. 

Proof. Suppose (u,v) G G x G but (u,v) £ E(G, N), so <f>(u) ± <p(v). Since G/N is 
residually V there is a homomorphism 77 : G/N — > K onto a finite group K G V such that 
ij(p(u) ^ il<ft(v) in K. Therefore the image of the pair (u, v) under r)<f> is not in the image 
of the subgroup E(G, N) in K x K. Hence the subgroup E(G, N) is closed in the pro-"P 
topology on G x G. □ 

The same argument gives the following 

Lemma 5.3. Under the assumptions of Lemma \5.Si the relative depth function Pe{g,n) ^ s 
at least as large as the depth function of G/N, the time complexities of the "yes" and "no" 
parts of the membership problem for E(G, N) are as high as the time complexities of the 
"yes" and "no" parts of the word problem in G/N. 
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Lemma 15.31 and Theorem 14.181 imply 

Remark 5.4. The converse of Lemma 15.21 also holds, namely, if E(G,N) is closed in the 
pro-T-'-topology, then G/N is residually V. We are not using this remark below so we leave 
it as an (easy) exercise. 

Theorem 5.5. For any recursive function f(n) there is a finitely generated subgroup H < 
F2 x F2 such that H is closed in the pro-finite topology on F2 x F2 and has distortion at 
least f{n). 

Proof. Let G = (X | R) be a finitely presented residually finite group with Dehn function at 
least f(n) from Theorem 14.181 If N is the normal closure of R in F(X) then the subgroup 
H = E(F(X), N) < F(X) x F(X) satisfies all the requirements of the theorem. 

Now one can embed the free group F(X) into F2 in such a way that the pro-finite topology 
induced on the image of F(X) from F2 is precisely the pro-finite topology on F(X). Indeed, 
there is a finite index subgroup H of F2 of rank \X\, the induced topology on H is the pro- 
finite topology on H . It follows that the pro-finite topology on the subgroup F\x\ of F2 is 
precisely the topology induced by the pro- finite topology from F2, as required. □ 

Applying the same argument to the free solvable groups S3 (X) of class 3 and generating 
set X one gets the following result. 

Theorem 5.6. For any recursive function f(n) there is a finite set X and a finitely gen- 
erated subgroup H < S^lX) x Ss(X) such that E is closed in the pro-finite topology on 
Ss(X) x S3 (X) and has distortion function, relative depth function, the time complexities 
of both "yes" and "no" parts of the membership problem and at least f{n). 

6. Universal theories of sets of finite solvable groups 

In this section we will prove the following result. For the class of all finite groups in was 
proved by Slobodskoi |60| (the idea of Slobodskoi's proof came from Gurevich's paper |23| 
where the same result was proved for semigroups). 

Theorem 6.1. The universal theories of the class of finite groups from ApAn ZN5A and 
the class of all periodic groups are recursively inseparable. In particular, the universal theory 
of any set of finite groups containing all finite solvable of class 3 groups is undecidable. 

Proof. It is well known |23] that there exists a Turing machine for which the set of input 
configurations accepted by the machine and the set of input configurations starting with 
which the machine never stops are recursively inseparable. Let M be a 2-glass Minsky 
machine with the same property. 

Consider the 4-glass Minsky machine M n described in the proof of Theorem 13.91 Let 
S'(M n ) be the semigroup given by the same defining relations as S(M n ) except the relation 
<7o = is substituted by the relation qiA^A^ = for every i. It does not affect the proof of 
Theorem 14.31 

Let G'(M n ) be the group corresponding to S'(M n ) in the same way G(M n ) corresponds to 
S(M n ). Then G'{M n ) belongs to A 2 p A n ZN§A and simulates M n as described in Theorem 
14.31 Let R be the (finite) set of defining relations of G'(M n ). Let X be the set of numbers 
e such that M n accepts the configuration (e,0, 1,0). Let X' be the set of numbers e such 
that M n works infinitely long starting with the configuration (e, 0,1,0). Then X and X' 
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are recursively inseparable by the choice of M and M n . For any configuration (e,0, 1,0) of 
M n consider the corresponding element 

w(e) = qi * di * A\ * A2 * 03 * A3 * A4. 

Suppose £Gl, Then there are only finite number of computations of Sym(M n ) starting 
at the configuration (1; e, 0, 1, 0). Then as in the proof of Theorem l4.19| there exists a finite 
homomorphic image H of G'(M n ) where the image of w(e) is not equal to 1. Hence the 
universal formula &i? — >• w(e) = 1 does not hold in the finite group H from ApA PI ZM5A. 

Now suppose that e X. Consider any periodic homomorphic image H of G'(M n ). Let 
t be the image of t € G'(M n ) in H. Then there exists a number D such that for every 
element x € T, 

f ,-\ -(D) -(2D) 

(17) x * a 3 = x * a 3 . 

Since M n works infinitely long starting at the configuration (e, 0,1,0), by Theorem 14.31 
the following equality is true for some i,k\,k2- 

w(e) = Xf A * * a^ 2 ^ * * A\ * A2 * A3 * A4. 



'l,qiA 

Then by (|17p and Theorem 14.3 



_(fci) -(2D) -(D) . 

w(e) = Xj* , A * a\ * a 2 * 03 * a\ * A\ * A% * As * A4 

.(An) ^ _(fc 2 ) 



X lq A * a l fel ^ * * a 3 ^ * a 4 * Ai * A2 * A3 * A4 



= x? „ a * a) * a\ * A\ * Ao * A3 * A4 = 1. 

since g^s^U = in S'(M n ). Hence the universal formula IkR —¥ w(e) = 1 holds in H. 

Thus the set of universal formulas &zR — > w(e) = 1 that do not hold in some finite group 
from ApA fl ZN5A and the set of such formulas which hold in every periodic group are 
recursively inseparable. 

□ 

Remark 6.2. Note that the universal theory of finite metabelian groups is decidable |32j . 
The same is true for the set of finite groups (and any other algebraic structures of finite type) 
of any finitely based variety where every finitely generated group is residually finite [32] . 
On the other hand, the universal theory of all finite nilpotent groups is undecidable |30| . 
The description of all (finitely based) varieties of groups where the universal theory of finite 
groups is decidable is currently out of reach. From Zelmanov's solution of the restricted 
Burnside problem \65\ [66] , it immediately follows that the universal theory of finite groups 
in every finitely based periodic variety of groups is decidable. That result and simulations 
of Minsky machines in semigroups (as in Section [3]) were used by the third author |56] 
to obtain a complete description of all finitely based varieties of semigroups where finite 
semigroups have decidable universal theory. For more information on that problem, see 
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